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Abstract —In the half-duplex single relay selection network, 
comprised of a source, M half-duplex relays, and a destination, 
only one relay is active at any given time, l.e., only one relay 
receives or transmits, and the other relays are inactive, i.e., they 
do not receive nor transmit. The capacity of this network, when 
all links are affected hy Independent slow time-continuous fading 
and additive white Gaussian noise (AWGN), is still unknown, and 
only achievable average rates have been reported in the literature 
so far. In this paper, we present new achievable average rates for 
this network which are larger than the best known average rates. 
These new average rates are achieved with a buffer-aided relaying 
protocol. Since the developed buffer-aided protocol introduces 
unbounded delay, we also devise a bnffer-aided protocol which 
limits the delay at the expense of a decrease in rate. Moreover, 
we discuss the practical implementation of the proposed buffer- 
aided relaying protocols and show that they do not require 
more resources for channel state information acquisition than 
the existing relay selection protocols. 

Index Terms —Buffer-aided relaying, half-duplex, relay selec¬ 
tion, achievable rate. 


I. Introduction 

C OOPERATIVE communication has recently gained 
much attention due to its ability to increase the through¬ 
put and/or reliability of wireless networks. The basic idea 
behind cooperative communication is that each node can 
act as a relay and help the other nodes of the network to 
forward their information to their respective destination nodes. 
Because of the high complexity inherent to the investigation of 
general cooperative networks, and to get insight into the basic 
challenges and benefits of cooperative communication, re¬ 
searchers have mainly considered relatively simple cooperative 
networks. Although simple, these basic cooperative networks 
reveal the gains that can be accomplished by cooperation 
among network nodes. Moreover, because of their simplicity, 
these basic cooperative networks can be easily integrated into 
the current communication infrastructure. One basic network 
which has shown great potential in terms of utility and 
performance is the half-duplex (HD) single relay selection 
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network proposed in HI. In this network, only one relay is 
active at any given time, i.e., one relay receives or transmits, 
and the other relays are inactive, i.e., they do not receive nor 
transmit. Because of the large achievable performance gains, 
this network has recently attracted considerable interest, see 
Q-im and references therein. Although well investigated, 
the capacity of this network is still unknown when all links 
are affected by independent slow time-continuous fading and 
additive white Gaussian noise (AWGN). So far, only achiev¬ 
able average ratef] have been reported in the literature, see 
m, HD. In fact, to the best of the authors’ knowledge, 
the achievable average rates in noi and HD are the largest 
average rates reported in the literature for this network. These 
rates are based on the relay selection protocol in HI. where, in 
each time slot, the relay with the strongest minimum source- 
to-relay and relay-to-destination channel is selected to forward 
the information from the source to the destination. In this 
paper, we will show that these rates can be surpassed. In 
particular, we develop a buffer-aided relaying protocol which 
achieves average rates which are significantly larger than the 
rates reported in Col and HD- Since the proposed buffer- 
aided protocol introduces unbounded delay, we also devise a 
second buffer-aided protocol which limits the average delay at 
the expense of a decrease in rate. Moreover, we show that the 
proposed buffer-aided relaying protocols do not require more 
resources for channel state information (CSI) acquisition than 
the existing relay selection protocols. 

Buffer-aided HD relaying with adaptive switching between 
reception and transmission was proposed in ifTSll for a sim¬ 
ple three-node relay network without source-destination link. 
Later, buffer-aided relaying was further analyzed in C3 and 
d for adaptive and fixed rate transmission, respectively. 
Buffer-aided relaying protocols were also proposed for two- 
way relaying in C3, ca, the multihop relay network in 
C3, two source and two destination pairs sharing a single 
relay in M, secure communication for two-hop relaying and 
relay selection in HD and 1201, respectively, and amplify-and- 
forward relaying in ED. Eor the considered relay selection 
network, relaying with buffers was investigated in ii and 
191 . However, the protocols in 13 and El are limited to the 
case when all nodes transmit with fixed rates and all source- 
to-relay and relay-to-destination links undergo independent 
and identically distributed (i.i.d.) fading. These protocols were 

* The “average rate” is also referred to as “expected rate” in the literature. 
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developed for improving the outage probability performance 
of the network. In order to use the protocols in JS) and 191 as 
performance benchmarks, we modify them such that all nodes 
transmit with rates equal to their underlying channel capacities. 
However, the modified protocols are still only applicable to 
the case when all links are affected by i.i.d. fading and will 
cause data loss due to buffer overflow for independent non- 
identically distributed (i.n.d.) fading. We note however that 
this drawback is not caused by our modifications since the 
phenomenon of buffer overflow also occurs for the original 
protocols in fS) and ||9l for fixed rate transmission when the 
links of the network are i.n.d. 

This paper is organized as follows. In Section HI] we 
introduce the system model. In Section |III| we present the 
proposed buffer-aided protocol for transmission without delay 
constraints. In Section UVl we discuss the implementation of 
the proposed protocol. In Section [Vj we propose a protocol 
for delay-limited transmission. In Section |Vl| we provide 
numerical examples comparing the achievable rates of the 
proposed protocols and the benchmark protocols. Finally, 
Section lynj concludes the paper. 

II. System Model 

In the following, we introduce the system model of the 
considered relay network. Furthermore, as benchmark scheme, 
we briefly review the conventional non-buffer-aided relay 
selection protocol in m. 

A. System Model 

The HD relay selection network consists of a source S, 
M HD decode-and-forward relays Rk, k = and a 

destination D, as shown in Fig. [T] The source transmits its 
information to the destination only through the relays, i.e., 
because of high attenuation there is no direct link between the 
source and the destination, and therefore, all the information 
that the destination receives is first processed by the relays. 
We assume that the transmission is performed in N time 
slots, where TV —^ oo. The relays in the network are HD 
nodes, i.e., they cannot transmit and receive at the same time. 
Furthermore, in each time slot, only one relay is active, i.e., 
it receives or transmits, and the other relays are inactive, i.e., 
they do not receive nor transmit. Each relay is equipped with 
a buffer of unlimited size in which it stores the information 
that it receives from the source and from which it extracts 
the information that it transmits to the destination. We assume 
that all nodes transmit their codewords with constant power P 
and that the noise at all receivers is independent AWGN with 
variance cr^. We assume transmission with capacity achiev¬ 
ing codes. Hence, the transmitted codewords are Gaussian 
distributed, comprised of n —^ oo symbols, and span one 
time slot. Moreover, we assume that each source-to-relay and 
relay-to-destination channel is affected by independent slow 
time-continuous fading such that the fading remains constant 
during a single time slot and changes from one time slot 
to the next. We assume that the fading is an ergodic and 
stationary random process. Let \hsk{i)\'^ and denote 

the squared amplitudes of the complex channel gains of the 
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source-to-fc-th-relay and fc-th-relay-to-destination channels in 
the i-th time slot, respectively, and let ilsk = 
and rikD = E{\hkD{'i)\'^} denote their mean values, respec¬ 
tively, where E{-} denotes expectation. Then, the signal-to- 
noise ratios (SNRs) of the source-to-fc-th-relay and fc-th-relay- 
to-destination channels are given by 

P P 

lSk{i) = ^\hsk{i)\'^ and = —\hkD{i)\'^ , (1) 

respectively. Furthermore, we denote the average SNRs of the 
source-to-fc-th-relay and fc-th-relay-to-destination channels by 
7Sfc = E{ysk{i)} and jkD = E{ykD{i)}, respectively. Using 
o, the capacities of the source-to-fc-th-relay and /c-th-relay- 
to-destination channels in the T-th time slot, denoted by Csk{i) 
and Ckoii), respectively, are given by 

Csk{i) = log 2 (1 + lsk{i)) (2) 

Ckoii) =\og2 {I+ ykD{i))- (3) 


B. Conventional Relay Selection Protocol 

For comparison purpose, we briefly review the conventional 
non-buffer-aided relay selection protocol im and its corre¬ 
sponding achievable average rate cni, im. 

The conventional relay selection protocol selects the relay k 
with the maximum min{C'sfc(z), C'fc£)(*)} for forwarding the 
information from the source to the destination in the z-th time 
slot m. The channel coding scheme adopted for conventional 
relaying is as follows. In the first half of time slot i, the source 
sends a codeword with rate min{C'sfc(z), CfcD(z)} to the k- 
th relay. The fc-th relay can successfully decode the received 
codeword since the rate of the codeword is smaller than or 
equal to Csk{i)- Then, in the second half of time slot i, the 
relay re-encodes the decoded information and sends it to the 
destination with rate min{C' 5 fe(z), The destination 

can successfully decode the received codeword since the rate 
of the codeword is smaller than or equal to Ckaii)- Hence, the 
overall rate transmitted from source to destination during time 
slot z is i min{Csfc(z), C'fc_D(z)}. Thereby, during N ^ oo 
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time slots, the average rate achieved with conventional relay¬ 
ing, denoted by Rconv, is obtained as Qo), HD 

Rconv = ^E{ maxmin{C'sfe(i), Ckoii)}}- (4) 

In the following, we present the proposed buffer-aided pro¬ 
tocols for the considered relay selection network and the 
corresponding achievable rates. 

III. Buffer-Aided Relaying Protocol without 
Delay Constraint 

In this section, we develop a buffer-aided relaying protocol 
without delay constraints which maximizes the achievable 
average rate for the considered network. To this end, we first 
introduce the instantaneous transmission rates at the nodes in 
each time slot, and then derive the corresponding achievable 
average rate. Next, we maximize the achievable average rate 
and derive analytical expressions for the maximum average 
rate. 


i.e., more than Qk{i — 1). The destination can successfully 
decode this codeword since Rkoii) < Cknii) holds, and 
stores the corresponding information. When the active relay 
transmits, Qk(i) decreases as 

Qfc(*) = Qfc(*-l)--RfcDW. (8) 

In the following, we obtain the average rates of buffer-aided 
single-relay selection. 

B. Average Transmission and Reception Rates 

In order to derive the average rates of buffer-aided single¬ 
relay selection, we first have to model the reception and 
transmission of the fc-th relay. To this end, we introduce two 
binary indicator variables r^{i) € {0,1} and rj{i) £ {0,1}, 
which indicate whether, in the i-th time slot, the fc-th relay 
receives or transmits, respectively. More precisely, r^(i) and 
rj(i) are defined as 


A. Instantaneous Transmission Rates 

In the considered HD single relay selection network, in 
a given time slot, only one relay is selected to receive or 
transmit, i.e., to be active. Without loss of generality, assume 
that the fc-th relay has been selected to be active in the i- 
th time sloQ Then, if the active relay is selected to receive, 
the source maps nRsk{i) bits of information to a Gaussian 
distributed codeword comprised of n —oo symbols, where 
each symbol is generated independently according to a zero- 
mean complex circular-symmetric Gaussian distribution with 
variance P, and transmits this codeword to the selected relay. 
The rate of this codeword Rskii) is set as 

Rsk{i) = Cskii), (5) 


(i) = 


1 if the fc-th relay receives 
0 if the fc-th relay does not receive, 

1 if the fc-th relay transmits 
0 if the fc-th relay does not transmit. 


(9) 

( 10 ) 


Since exactly one relay is active in each time slot, r^(i) and 
rj(i) must satisfy 


M 

-brj(i)] = 1 ,Vl (11) 

k=l 

Using r^(i) and rj(i), the average rates received at and 
transmitted by the fc-th relay, denoted by Rsk and Rko, 
respectively, can be expressed as 


where Csk{i) is the capacity of the source-to-fc-th-relay chan¬ 
nel given in (|2|i. As a result of Q, the active relay can suc¬ 
cessfully decode this codeword and stores the corresponding 
information in its buffer. Let Qk{i) denote the number of 
bits/symbol in the buffer of the fc-th relay at the end of time 
slot i. Then, with this transmission, Qk{i) increases as 

Qk{i) = Qk{i-E) + Csk{i)- (6) 

On the other hand, if the active relay is selected to transmit, 
it extracts nRkD{i) bits of information from its buffer, maps 
it to a Gaussian distributed codeword comprised of n —?► c» 
symbols, where each symbol is generated independently ac¬ 
cording to a zero-mean complex circular-symmetric Gaussian 
distribution with variance P, and transmits it to the destination. 
The rate of this codeword is Rkoii), which is set as 

Rkoii) =min{Qfc(z- l),CkD{i)}, (7) 

where Cknii) is the capacity of the fc-th-relay-to-destination 
channel given in (|2|i. The minimum in the expression for rate 
Rkoii) is a consequence of the fact that the relay cannot 
transmit more information than what it has stored in its buffer. 


Rsk 


RkD 


1 ^ 

lim -j;i^^k{i)Rsk(i) 

N^oo I\ 

1 ^ 

1 ^ 

Jim T7'^r'^{i)RkD{i) 

N—¥00 iV 

2=1 
1 ^ 

J™ l),C'fcr,(i)}. (13) 

N^oo ^' 


Using RkD, Vfc, the average rate received at the destination, 
denoted by Rsd, can be expressed as 


M 

Rsd = ^ RkD 

k=l 


^ N M 

n -1 1 -1 


l),CkD (i)}- 

( 14 ) 


^How exactly the active relay is selected is explained in Theorem 1. 


In the following, our goal is to maximize Rsd- 
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C. Maximization of the Average Rate 

In (fTSl i and (fOl l. the only variables with a degree of freedom 
are r^(i) and rjF(i), Vi, k. Any choice of these variables will 
provide an average rate. However, in order for an average rate 
to be achievable, i.e., for data loss not to occur, the buffers 
at all relays must remain stabl^E Moreover, among all the 
achievable average rates, there exists one rate which is the 
largest. In order to obtain the largest achievable average rate, 
we have to hnd the optimal values of r^(i) and Vi,k, 

which maximize the average rate in (fT4li when constraint (fTTli 
holds and when the buffers at all relays are stable. To this end, 
we introduce the following useful lemma. 

Lemma 1: The achievable average rate is maximized when 
rj?(i) and rj(i), Vi, are chosen such that the following 
condition is satished for all k = 


N 


Af-»-oo A ^' N^oc ' 


Moreover, when (fTSl l holds for the /c-th relay, (fOl l simplihes 
to 


= lim (16) 

N—¥oo I\ 

and when (flSl t holds VA: relays, (O simplifies to 


N M 

= JToo (v)CkD (*). 

i=l k=l 

Proof: Please refer to Appendix lAl 
With Lemma [T] we have reduced the search space for the 
maximum achievable average rate to only those rates for which 
(ITSl l holds Vk. Moreover, we have obtained an expression for 
Rsd which is independent of Qk{i), Vi,k. Now, in order 
to hnd the maximum achievable average rate, we devise a 
maximization problem for the average rate, Rsd, under the 
constraints given in (flSl l and (fTTT i. This maximization problem, 
for A —>■ oo, is given by 



Maximize : 

’’fc 

Subject to : 


= wJ:lir'^{^)CkD{i),Vk 

C2:rR(*)e{0,l}, Vk,i 
C3:rJii) € {0,1}, Vk,i 


(18) 


In (fTSl l. the restrictions in (fTSl l and (fTTTi are rehected in 
constraints Cl and C4, respectively. Fortunately, (HEl can be 
solved analytically. The solution reveals how the values of 
and are to be chosen optimally in each time slot 

i such that the maximum average rate of the buffer-aided 
protocol is achieved. Before providing the solution to (fTsT l. we 
hrst introduce some notations. Let /i^, k = 1,...,M, denote 
constants which are independent of the time slot i and the 


^By a stable buffer we mean that there is no information loss in the buffer 
and the information that enters the buffer eventually leaves the buffer, i.e., no 
information is trapped inside the buffer. 


instantaneous CSI. The values of these constants depend on 
the fading statistics and will be determined later, cf. Lemma |2] 
Then, for a given time slot i, we multiply each Csk{i) with 
/ife and each Ckoii) with (1 — /i^), and collect these products 
in set A{i). Hence, A{i) is given by 

-^(0 = {piCsi{'i),P2Cs2{i),---,PMCsM{i), 

(1 — Pi)CiD{i), (1 — P'2)C2D{i), ■■■, (1 — Mm)C'md(*)}- 

(19) 

We are now ready to present the solution to (fTSl l in the 
following theorem, which represents the proposed protocol for 
transmission without delay constraints. 

Theorem 1: The optimal values of r^{i) and rfii), Vk,i 
which maximize the achievable average rate of the proposed 
protocol are given by 

{ rj(i) = l if (1 -/rfc)C'fc_D(i) = max,4(z) 

r^(i) = 1 if gLkCsk{i) = max,A(i) (20) 

rj(i) = r^{i) = 0 otherwise, 

where the /i^, Vk, are chosen such that constraint Cl in (fTSl) 
is satished Vk. The maximum achievable average rate of the 
proposed protocol is given by (flTl) when r^{i) and rj{i) are 
set as in (l20l) . Vi, k. 

Proof: Please see Appendix iBl ■ 

Remark 1: Theorem [T] reveals that the optimal values of 
r^(i) and r'^(i) depend only on the instantaneous CSI of the 
Lth time slot, and are independent of the instantaneous CSIs 
of past and future time slots. 


D. Analytical Characterization of the Maximum Achievable 
Rate 


By inserting (l20l) into (fT4l i. we obtain the maximum achiev¬ 
able rate of the proposed protocol as an average over N ^ oo 
time slots, which may not be convenient from an analytical 
point of view. Furthermore, Theorem [T] does not provide an 
expression for obtaining constants Vk. In order to obtain 
useful analytical expressions for the maximum achievable 
average rate and constants fik, Vk, we exploit the assumed 
ergodicity and stationarity of the fading, and write (fTSl l (i.e., 
constraint Cl in ( fTSl l) and ( fTTb equivalently as 


L;{iog2(i + rsi(*))} 

L;{iog2(i + rs2(*))} 


£;{iog2(i + riz>(i))} 
F;{iog2(i -f r2u(z))} 


L;{iog2(i + rsM(t))} = E{\og^{i + rMD{i))} 


( 21 ) 


and 


M 


Rsd = y]L;{log2(l + rfez?(t))}, (22) 




respectively, where Tsk{i) = rf{i)jsk{i) and TkD{i) = 
rj(i) 7 fc£)(i), with r^(i) and as in (l20l i. In the following 
two lemmas, we provide simplihed expressions for the max¬ 
imum average rate Rsd and constants Vk. Thereby, we 
drop index i since, due to the stationarity and ergodicity of the 


5 


fading, the statistics of Tskii) and rknii) are independent of 
i. 

Lemma 2: The optimal values of Hk, fc = 1, M, denoted 
by which maximize Rsd, can be obtained by solving the 
following system of M equations 

(J^^og2il+x)frsiix)dx = /o°°log2(l +x)/rio(x)dx 
/o“ log2(l + x)frs2 {x)dx = log2(l + x)/r2o {x)dx 


/o°“log 2 (l +a:)/rsM(a:)c^a: = /“ log 2 (l + x)/r„c (a;)dx, 

(23) 


where, for a; > 0 , 

/rsJa;) = (^{1 + x)^ - l) 

M 

n ((1 + a;)"^ - Ij ((1 +x)~ -Ij, 
i=i 

(24) 

fr^D (x) = (a;)TVs, (^(1 + x) - 1 j 

n (^(1 + a;)^^ - . 

(25) 

Here, {x) and {x) denote the probability density func¬ 
tion (PDF) and cumulative distribution function (CDF) of 7 ^, 
a G {Sk^kD}, respectively. Furthermore, if the fading on 
all source-to-relay and relay-to-destination links is i.i.d., the 
solution to (l23t is /i^ = 1/2, Vfc. 

Proof: Please refer to Appendix ICl ■ 

Remark 2: For i.i.d. links, since fjf. = 1/2, Vfc, the pro¬ 
posed protocol, given by (| 20 ] |. always selects the link with the 
largest instantaneous channel gain among all 2M available 
links for transmission. Hence, for i.i.d. links this protocol 
becomes identical to the protocol proposed in IS). However, 
for i.n.d. links, the protocol in ||8l will cause data loss due to 
buffer overflow. In particular, applying the protocols in and 
0 , the buffers at relays with Llsk > ^kD suffer from overflow 
and receive more information than they can transmit. Hence, a 
fraction of the source’s data is trapped inside the relay buffers 
and does not reach the destination, i.e., data loss occurs. On the 
other hand, our proposed protocol is applicable for all fading 
statistics. 

Lemma 3: The maximum achievable average rate of the 
protocol in Theorem 1 is given by 

^ I'OO 

RgD='^ log 2 (l-f a:)/r^^(x)dx, (26) 

k=i'’^ 

where /p (x) is obtained by inserting ^k = Fk found using 
Lemma into fr^D (®) given by ( |25] |. For i.i.d. fading on 
all links, i.e., when f'^si,{x) = f-y^j^ix) = f-y{x), Vfc, and 


^ 7 s/c(a^) = = F^ix), Vfc, ( l26l l simplifies to 

nOO 

Rsd = M \og2{l + x)f^{x) {F^{x)f’^~^ dx. (27) 
Jo 

Proof: Let us insert the optimal /i^, Vfc, found from 
Lemma into fr^D i^) given in ( |25] ) and denote it by 
/p^^(a;). Eq. (l26l l is obtained by inserting fr^^{x) into (l22li . 
whereas (l27l i is obtained by inserting = l/2 into (|26] | and 
simplifying the resulting expression. ■ 

To get more insight, in the following we investigate the case 
of i.i.d. Rayleigh fading. 


E. Special Case: I.i.d. Rayleigh Fading 

In the following, we simplify the expression for the maxi¬ 
mum average rate in dZTl) for i.i.d. Rayleigh fading. 

The expression fj^,^{x) = 2Mf.y{x) {F^{x)f^~^ in 
can be interpreted as the distribution of the largest 
random variable (RV) among 2M i.i.d. RVs with distributions 
fiski^) = see 123 . For i.i.d. Rayleigh 

fading, i.e., when f-ys^[x) = f-yi,y,{x) = jf, Vfc, where 
7 is the average SNR of all source-to-relay and relay-to- 
destination links, f-y^^^ix) is given as ll 2 ^ 


/ 7 n.»(a:) = 2M ^ (-1)'= i exp (^^{k + 1)^ . 

(28) 

Inserting (|28] | into (l27l) and integrating, we obtain the average 
rate as 


2M-1 


Rsd = M ^ 

k=0 

P - 1 + ^ 
X El 

7 


(2M-l\ (-1) 


V fc y (1 4- fc) ln(2) 


exp 


1 -I- fc 


(29) 


where Eyf) is the first order exponential integral function 
defined as Ei{x) = /{t)dt. On the other hand, for 

the same case, i.e., for i.i.d. Rayleigh fading on all links, the 
achievable rate for conventional relay selection given in (|4]i 
can be written equivalently as ll24l Eq. (26)] 


R 


conv 


M fM - 1\ (-1)'= 

Y k j(l + fc)ln(2) 


exp 


X E-i 


2(1+ fc) 

7 


2(l + fc) \ 

7 J 

(30) 


In order to gain further insight, expressions (|2^ and (l30l) 
can be further simplified for low and high SNRs using the 
following first order Taylor approximations 


exp(c/ 7 )E;i (c/ 7 ) = as 7 -)> 0, (31) 

7 

exp(c/ 7 )L;i (c/ 7 ) = -Kem - ln(c) -f ln( 7 ), as 7 -)■ 00 , 

(32) 


system of nonlinear equations can be solved e.g. by algorithms based where Kem is the Euler-Mascheroni constant and its value is 
on Newton’s method m Kem « 0 . 577 . 
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1) Low SNR: Using ( ISTT i. the rates in ( l29l l and ( l30t can be 
approximated as 


Rsd 


Rc 


7 


2M ^ 

as 7 


21 n(2) ^ k 
M ^ 

» ^ yi 

41n(2) ^ k’ 


0 


as 7 —>• 0. 


Dividing (l33t by (l74l i. we obtain the following ratio 


Rsd 

Rconv 


= 2 


E 2M 

k—1 k 


M 


= 1 k 


(33) 


(34) 


(35) 


For M = 1 and M —oo, the ratio in (l35l l is equal to 3 and 
2 , respectively, which constitute the upper and lower bounds 
of dlSl l for 1 < M < oo. Hence, for low SNRs, the average 
rate of the proposed buffer-aided relay selection protocol is 2 
to 3 times higher than the rate of conventional relay selection. 

2) High SNR: On the other hand, using ( l32l i. the rates in 
and (|30] | can be approximated as 


Rsd 


- M 


1 - K 


EM 


2M-1 

E 

fe =0 


21 n(2) 
2M- 1 
k 


(-l)'=log2(l -b k) 

(1 + fc) 


as 7 —>• oo, 


R. 


7 ~ Kem 


2 ln(2) 

M ^ 

~ Y ^ 

k=0 


M-1 
k 


(-l)Mog2(l + A:) 1 


(1 + fc) 


2 ’ 


as 7 ■ 


(36) 


oo. 


(37) 


Subtracting (IJTT i from (l36l l. we obtain 


Rsd — R 


conv 


1 M - 1\ (-l)felog 2 (l + fc) 

2 2 k J (1 + fc) 



(-I)^l0g2(l + fc) 

(1 + fc) 

(38) 


For M = 1 and M oo, the expression in (l38T l evaluates to 
1 and 1/2, respectively, which constitute the upper and lower 
bounds of ( l38l l for 1 < M < oo. Hence, for high SNRs, 
the average rate of the proposed buffer-aided relay selection 
protocol is between 1 and 1/2 bits/symb larger than the rate 
of conventional relay selection. 

In the following, we discuss the implementation of the 
proposed buffer-aided HD relay selection protocol. 


IV. Implementation of the Proposed Buffer-Aided 
Protocol 

In this section, we discuss the implementation of the pro¬ 
tocol proposed in Theorem [1] The proposed protocol can 
be implemented in a centralized or in a distributed manner. 
A centralized implementation assumes a central node which 
selects the active relay in each time slot and decides whether 


it should receive or transmit. On the other hand, in the dis¬ 
tributed implementation, there is no central node and the relays 
themselves negotiate which relay should be active in each time 
slot. In the following, we discuss both implementations. 


A. Centralized Implementation 

For the centralized implementation, we assume that the 
destination is the central node. Hence, in each time slot, the 
destination has to obtain the CSI of all links. To this end, 
at the beginning of each time slot, the source transmits pilot 
symbols from which all relays acquire their respective source- 
to-relay CSIs. Then, each relay broadcasts orthogonal pilots, 
from which the source and destination learn all source-to-relay 
and relay-to-destination CSIs, respectively. Next, each relay 
feedsbaclJl the CSI of its respective source-to-relay channel 
to the destination. With the acquired CSI, the destination 
computes Csk{i) and CkD{i), Vfc. In order to select the active 
relay according to the protocol in Theorem [T] the destination 
has to construct set A{i), given by (fT^ . This requires the 
computation of the constants Vfc. These constants can 
be computed using Lemma |2] but this requires knowledge of 
the PDFs of the fading gains of all links before the start of 
transmission. Such a priori knowledge may not be available in 
practice. In this case, the destination has to estimate Vfc, in 
real-time using only the CSI knowledge until time slot i. Since 
pLk, Vfc, are actually Lagrange multipliers obtained by solving 
the linear optimization problem in (ISTT i. an accurate estimate 
of pk, Vfc, can be obtained using the gradient descent method 
ll25l . In particular, using Csk{i) and CkD{i), the destination 
recursively computes an estimate of pk, denoted by as 

= Plii - 1) + Ski^)iRlDi^ - 1) - Rlkii - 1)), (39) 

where Rgki't ~ 1) and R%g,(i — 1) are real-time estimates of 
Rsk and RkD, respectively, computed for i > 2 as 

~ 1 ) = ^ ~ 2 ) + ^Skii — 1 ), 

(40) 

RId{^ - 1 ) = - 2 ) + '^^^^^CkD[^ - 1 ), 

1—1 1—1 

(41) 

where 7?|fc(0) and R%g,(Y) are set to zero Vfc. In d^ . 5k{i) is 
an adaptive step size which controls the speed of convergence 
of p%{i) to pLk- In particular, the step size 6k{i) is some 
properly chosen monotonically decaying function of i with 
(5fe(l) < 1, see II 25 I for more details. 

Once the destination has Csk{i), CkD{i), and Vfc, it 

constructs the set A{i), and selects the active relay according 
to Theorem [T] The destination also has to keep track of the 
queue length in the buffers at each relay in each time slot. 
To this end, using Csk{i), CkD{i), (*), and Vfc, the 

destination computes the queue length in the buffers at each 


^This feedback can also be done using pilots. In paiticular, since the 
destination already knows the channel between each relay and itself, each 
relay can broadcast pilots whose amplitude is equal to the channel gain of 
the channel from the source to the selected relay. 
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relay using the following formula 

Qk{i) = - 1) + r^{i)Csk{'i) 

- r-fc (i) miii{Qfe(i - l),C'fcD(i)}. (42) 

Then, the destination broadcasts a control message to the 
relays which contains information regarding which relay is 
selected and whether it will receive or transmit. If the selected 
relay is scheduled to transmit, it extracts information bits 
from its buffer, maps them to a codeword, and transmits the 
codeword to the destination with rate Rkoi}) = Txim{Qk{i — 
l),C'fen(*)}- Otherwise, if the selected relay is scheduled to 
receive, it sends a control message to the source which informs 
the source which relay is selected. Then, the source transmits 
the information codeword intended for the selected relay with 
rate Rsk{i) = Csk{i)- 

The destination may receive the information bits in an order 
which is different from that in which they were transmitted by 
the source. However, using the acquired CSI, the destination 
can keep track of the amount of information received and 
transmitted by each relay in each time slot. This information is 
sufficient for the destination to perform successful reordering 
of the received information bits. 

B. Distributed Implementation 

We now outline the distributed implementation of the pro¬ 
posed protocol using timers, similar to the scheme in HI. 

At the beginning of time slot i, source and destination 
transmit pilots in successive pilot time slots. This enables the 
relays to acquire the CSI of their respective source-to-relay and 
relay-to-destination channels, respectively. Using the acquired 
CSI, the fc-th relay computes C'sfc(i) and Cknii)- Next, using 
Cskii) and Ckni}), the fc-th relay computes the estimate of 
p,k, using (1^, dinil, and (I1T]|. Using Csk{i), Ckoii), 

and the fc-th relay turns on a timer proportional to 

1/max{/r^.(i)C'sfc(*), (1 - /t|(j))C'fc£)(i)}. This procedure is 
performed by all M relays. If 

max{/r|(i)C'sfc(i), (1 - Mfc(i))C'fcr,(*)} = til{i)Csk{i) 

and 

max{/r^.(i)C'sfc(i), (1 - til{i))CkD{i)} = (1 - 

the fc-th relay knows that if it is selected, then it will receive 
and transmit, respectively. The relay whose timer expires first, 
broadcasts a packet containing pilot symbols and a control 
message with information about which relay is selected and 
whether the selected relay receives or transmits. From the 
packet broadcasted by the selected relay, both source and 
destination learn the channels from the selected relay to the 
source and destination, respectively. They also learn which 
relay is selected and whether it is scheduled to receive or 
transmit. If the selected relay is scheduled to transmit, then 
it extracts bits from its buffer, maps them to a codeword and 
transmits the codeword to the destination with rate RkD{i) = 
min{(5fc(* — 1), C'fcu(i)}. Otherwise, if the relay is scheduled 
to receive, then the source transmits to the selected relay a 
codeword with rate Cskii)- 


Again, the destination may receive the information bits in 
an order which is different from that in which they were 
transmitted by the source. Therefore, in order for the desti¬ 
nation to reorder the received information bits, it should keep 
track of the amount of information received and transmitted 
by each relay in each time slot. If the selected relay transmits, 
by successful decoding the destination learns the amount of 
information received. However, when the selected relay is 
scheduled to receive, the relay should feedback the amount 
of information that it received to the destination. Using this 
information, the destination can perform successful reordering 
of the received information bits. 

Remark 3: We note that distributed relay selection proto¬ 
cols based on timers may suffer from long waiting times before 
the first timer expires. Moreover, collisions are possible when 
two or more relay nodes declare that they are the selected 
node at approximately the same time. However, by choosing 
the timers suitably, as proposed in E6i . these negative effects 
can be minimized. 

C. Comparison of the Overhead of the Conventional and the 
Proposed Protocols 

The conventional relay selection protocol reviewed in Sec¬ 
tion III-BI can also be implemented in a centralized or a 
distributed manner. In the following, we discuss the overheads 
entailed by both implementations. 

For the centralized implementation, the destination controls 
the relay selection. To this end, the destination has to acquire 
the CSI of all links in the network. Therefore, for centralized 
implementation, in each time slot, 2M -f 2 pilot symbol 
transmissions are required for CSI acquisition, one control 
packet transmission by the destination is needed to inform 
the relays which relay is selected, and another control packet 
transmission is required for the selected relay to inform the 
source which relay is selected. Moreover, the source has to 
acquire knowledge of min{C'5fc(*)) Cknii)} in order to select 
the rate of transmission. Hence, if mm{Csk{i),CkD{'i)} = 
CkD{i), the selected relay has to feedback the CSI of the 
selected-relay-to-destination link to the source. As a result, 
in total 2M -t- 4 or 2M + 5 pilot symbol, feedback, and 
control packet transmissions are needed in each time slot. 
On the other hand, for the centralized implementation of the 
proposed buffer-aided relaying protocol, also 2M-I-4 or 2M-I-5 
pilot symbol, feedback, and control packet transmissions are 
required. Hence, both the conventional and the proposed 
buffer-aided relaying protocols have identical overheads when 
implemented centrally. 

For conventional relay selection with distributed implemen¬ 
tation, each relay has to acquire the CSI of its source-to- 
relay and relay-to-destination links. To this end, two pilot 
transmissions, one from the source and the other from the 
destination, are needed. Moreover, one packet with pilots and 
a control message from the selected relay are needed to inform 
source and destination which relay is selected, and to allow 
source and destination to learn the CSI of the source-to- 
selected-relay and selected-relay-to-destination links, respec¬ 
tively. Furthermore, assuming relay k is the selected relay in 




time slot i, in order for the source to adapt its transmission rate 
to mm{Csk{'i),CkD{‘i)} and the destination to know which 
codebook to use for decoding in time slot i, both source 
and relay have to know min{C 5 fe(i), Ckaii)}- Acquiring this 
CSI knowladge requires feedback of the source-to-relay or the 
relay-to-destination channel from the relay to the destination 
or the source, respectively. Hence, the distributed implemen¬ 
tation of conventional relay selection requires 4 pilot symbol, 
feedback, and control packet transmissions. On the other hand, 
the distributed implementation of the proposed buffer-aided 
relaying protocol has the same overhead as conventional relay 
selection since it also requires 4 pilot symbol, feedback, and 
control packet transmissions. 

As can be seen from the above discussion, the proposed 
buffer-aided protocol does not require more signaling over¬ 
head than the conventional relay selection protocol. We note, 
however, that the proposed protocol requires the computation 
of and Qk{i), VA:, which are not required for the 

conventional protocols. On the other hand, the computational 
complexity of obtaining ^%{i) and Qk(i) using dS^ - fiTI) and 
(l42l i. respectively, is not high since these equations require 
only one or two additions and one to three multiplications. 

V. Buffer-Aided Relaying Protocol with a Delay 
Constraint 

The protocol in Theorem [T] with the Vfe, obtained 
from Lemma |2] gives the maximum average achievable rate, 
but introduces unbounded delay. To bound the delay, in the 
following, we propose a buffer-aided relaying protocol for 
delay limited transmission. Before presenting the protocol, we 
first determine the average delay for the considered network. 


A. Average Delay 

The average delay for the considered network, denoted by 
T, is specified in the following lemma. 

Lemma 4: The average delay for the considered network is 
given by 


rp _ Qfc 

EtiRsk' 


(43) 


where Rsk is the average rate received at the fc-th relay and 
given by (fT2l i. Furthermore, Qk is the average queue size in 
the buffer of the fc-th relay, which is found as 


*3^ = j™ ^'^Qk{i)- 

N—¥oo iV 

Proof: Please refer to Appendix iDl 
The queue size at time slot i can be obtained using (l42li . 
Due to the recursiveness of the expression in (l42li . it is 
difficult, if not impossible, to obtain an analytical expression 
for the average queue size Qk for a general buffer-aided relay 
selection protocol. Hence, in contrast to the case without delay 
constraint, for the delay limited case, it is very difficult to 
formulate an optimization problem for maximization of the 
average rate subject to some average delay constraint. As a 
result, in the following, we develop a simple heuristic protocol 



for delay limited transmission. The proposed protocol is a 
distributed protocol in the sense that the relays themselves 
negotiate which relay should receive or transmit in each time 
slot such that the average delay constraint is satisfied. We note 
that the proposed protocol does not need any knowledge of the 
statistics of the channels. 

B. Distributed Buffer-Aided Protocol 

Before presenting the proposed heuristic protocol for delay 
limited transmission, we first explain the intuition behind the 
protocol. 

1) Intuition Behind the Protocol: Assume that we have 
a buffer-aided protocol which, when implemented in the 
considered network, enforces the following relation 

S±- = To, Vfc, (45) 

P-Sk 

i.e., the average queue length divided by the average arrival 
rate in the buffer at the fc-th relay is equal to Tq. If (l45l l 
holds Vfc, then by inserting ( l45l l into ( l43l l, we see that the 
average delay of the network will be T = Tq. Moreover, 
enforcing ( l45l l at the fc-th relay requires only local knowledge, 
i.e., only knowledge of the average queue length and the 
average arrival rate at the fc-th relay is required. Hence, this 
protocol can be implemented in a distributed manner. There 
are many ways to enforce (|45] | at the fc-th relay. Our preferred 
method for enforcing (l45T l is to have the fc-th relay receive 
and transmit when Qk{i)/Rsk < To and Qk{i)/Rsk > To 
occur, respectively. Moreover, we prefer a protocol in which 
the more Qk{i)/Rsk differs from Tq, the higher the chance 
of selecting the fc-th relay should be. In this way, Qk{i)/Rsk 
becomes a random process which exhibits fluctuation around 
its mean value Tq, and thereby achieves (l45l l in the long run. 
We are now ready to present the proposed protocol. 

2 ) The Proposed Protocol for Delay-Limited Transmission: 
Let To be the desired average delay constraint of the system. 
At the beginning of time slot i, source and destination transmit 
pilots in successive pilot time slots. This enables the relays to 
acquire the CSI of their respective source-to-relay and relay-to- 
destination channels. Using the acquired CSI, the fc-th relay 
computes Csk{i) and Ckaii)- Next, using Cskii) and the 
amount of normalized information in its buffer, Qk{i— 1), the 
fc-th relay computes a variable Afc(i) as follows 

X^i^) = Xk{i-l)PQkii){To-%^j^^, (46) 

where R%f,,{i — 1) is a real-time estimate of Rsk, computed 
using (l40l i. In (l46l l. Cfc(0 ts the step size function, which is 
some properly chosen monotonically decaying function of i 
with Cfe(l) < 1- Now, using Csk{i), Ckoif), Qk(i - 1), and 
Xk(i), the fc-th relay turns on a timer proportional to 


max{Afc(i)C'sfc(*) , min{(5fc(i - 1), C'fc_D(i)}/Afe(i)} 
This procedure is performed by all M relays. If 

max{Afc(z)C'sfc(i) , min{Qfc(z - l),CkD{i)}/Xk{i)} 
= ^k{'i)Csk{'i) 
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and 

max{Afe(i)C'5fc(i) , min{(3/c(i - 1), C'fe_D(i)}/Afc(i)} 

= miii{Qfc(i - l),C'fen(*)}/Afc(i), (48) 

the fc-th relay knows that if it is selected, then it will receive 
and transmit, respectively. The relay whose timer expires 
first, broadcasts a control packet containing pilot symbols and 
information about which relay is selected and whether the se¬ 
lected relay receives or transmits. From the packet broadcasted 
by the selected relay, both source and destination learn the 
source-to-selected-relay and the selected-relay-to-destination 
channels, respectively, and which relay is selected and whether 
it is scheduled to receive or transmit. If the selected relay is 
scheduled to transmit, then it extracts information from its 
buffer and transmits a codeword to the destination with rate 
Rkoi'i) = niin{Qfc(i — 1), C'fcD(i)}. However, if the relay is 
scheduled to receive, then the source transmits a codeword 
to the fc-th relay with rate = Cski})- In this case, 

the relay has to feedback its source-to-relay channel to the 
destination. This fedback CSI is needed by the destination 
to keep track of the amount of information that each relay 
receives and transmits in each time slot so that the destination 
can perform successful reordering of the received information 
bits. Moreover, exploiting (l42l i. this information is used by the 
destination to compute the queue length in the buffer at each 
relay, Qkii). 

Remark 4: Note that with (|46] | we achieve the aforemen¬ 
tioned goal of increasing the probability of selecting the fc-th 
relay when Qk{i)/Rsk differs more from Tg. More precisely, 
if Qk{i)/Rsk < To, then Afc(i) increases and 1/Afc(i) de¬ 
creases, giving the fc-th relay a higher chance to be selected 
for reception. On the other hand, if Qk{i)/Rsk > To. then 
Afc(i) decreases and l/\k{i) increases, giving the fc-th relay 
a higher chance to be selected for transmission. 

Remark 5: The required overhead of the proposed dis¬ 
tributed delay-limited protocol is identical to the overhead 
of the proposed distributed protocol without delay constraint. 
Furthermore, the delay-limited buffer-aided protocol can also 
be implemented in a centralized manner, similar to the scheme 
in Section ITV-AI The centralized implementation of the delay- 
limited protocol has an overhead identical to the overhead 
of the centralized protocol without delay constraint, see Sec¬ 
tion IIV-AI A summary of the overheads of conventional 
relay selection protocols and the proposed buffer-aided (BA) 
relaying protocols with and without delay constraint is given 
in Table |T] 

VI. Numerical Examples 

We assume that all source-to-relay and relay-to-destination 
links are impaired by Rayleigh fading. Throughout this sec¬ 
tion, we use the abbreviation “BA” to denote “buffer-aided”. 

In Fig. |2] we plot the theoretical maximum average rate 
obtained from Theorem 1, and Lemmas |2] and [3l for M = 5 
relays and i.n.d. fading, where 

[nsi,ns 2 ,ns 3 ,nsi,ns 5 ] = [ 0 . 5 , 1 , 1 . 5 , 2 , 2 . 5 ] 



Fig. 2. Achievable average rates for M = 5, [flgi, f^S3i f^S 4 i ^ss] = 
[0.5, 1, 1.5, 2, 2.5], and [^ 110 ^ 20 ,^ 30 ,^^ 40 ,^ 50 ] = 

[3, 1.3, 0.9, 1.1, 0.7], 



Fig. 3. Estimated /r|(r) and as a function of the time slot i. 

and 

[^}iD,^2D,^3D,^4D,^5d] = [3, 1.3, 0.9, 1.1, 0.7]. 

We have also included simulation results for the proposed 
buffer-aided protocol, where the k = 1,..., 5, are found 

using the recursive method in (|39]l with Sk{i) = Q.l/^/i, Vfc. 
As can be seen, the simulated average rate coincides perfectly 
with the theoretical average rate. As a benchmark, in Fig. |2] 
we show the average rate given in Qol and im. Moreover, we 
have also included the average rates achieved using the delay 
limited BA protocol introduced in Section [V-BI for an average 
delay of T = 5 and T = 10 time slots. For the delay limited 
protocol, in order to evaluate (l46l l we have used Afc(l) = 0.9 
and the step size function Cfe(*) = 0.005/v^/ log 2 (l + F’/cr^), 
Vfc. As can be seen from Fig. |2] both the delay-unlimited and 
the delay-limited BA protocols achieve higher rates than the 
rate achieved in M and im. We note that we cannot use 
the protocols in HI and ||9l as benchmarks in Fig. |2] since 
these protocols are not applicable in i.n.d. fading as the buffers 
would become unstable. In particular, for the protocols in 
il and 13, the buffers at relays with ilgk > ^kD would 
suffer from overflow and receive more information than they 
can transmit. Hence, a fraction of the source’s data would be 
trapped inside the buffers and does not reach the destination, 
i.e., data loss would occur. 

For the parameters adopted in Fig.|2] we show in Fig. |3 the 
corresponding constants pi and ^5 obtained using Lemma |2] 






















TABLE I 

Number of pilot symbol, feedback, and control packet transmissions required for the conventional and the proposed 

BUFFER-AIDED (BA) PROTOCOLS PER TIME SLOT 


10 



Conventional 

BA protocols with and without delay constraint 

Centralized 

2M -I- 4 or 2M -I- 5 

2M + 4 or 2M + 5 

Distributed 

4 

4 




Fig. 4. Average delay until time slot i for Tq = 5 and different 
3r 
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I ^ Protocol in [9] 
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Number of relays 


30 


Fig. 5. Achievable average rates for = Q,kD = V/c, as a function 
of the number of relays M. 


and the corresponding estimated parameters pf (0 and 
obtained using the recursive method in ( |39] ) as functions 
of time for P/a^ = 0 dB. As can be seen from Fig. [3 
the estimated parameters n\{i) and converge relatively 

quickly to and ^ 5 , respectively. 

Furthermore, for the parameters adopted in Fig. |2] we 
have plotted the average delay of the proposed delay-limited 
protocol until time slot i in Fig. 0] for the case when Tq = 5 
time slots, and P/cr^ = 20 dB and P/cr^ = 25 dB. The 
average delay until time slot i is computed based on 143 ) where 
the queue size and the arrival rates are averaged over the time 
window from the hrst time slot to the i-th time slot. Hence, 
for finite i, the average delay until time slot i is the average 
of a random process over a time window of limited duration. 
Because of the assumed ergodicity, for i —?> cxd, the size of 
the averaging window becomes infinite and the time average 
converges to the mean of this random process. However, for 
i < 00 , the time average is still a random process. This is 
the reason for the random fluctuations in the average delay 
until time slot i in Fig. |4] Nevertheless, Fig. |4] shows that the 
average delay until time slot i converges relatively fast to Tq 
as i increases. Moreover, after the average delay has reached 
To, it exhibits relatively small fluctuations around Tq. 


Fig. 6. Achievable average rates for flsk = ^kD = 1, Vfc. vs Pjcr^ for 
different number of relays M, and different delay. 


In Fig. 13 we plot the theoretical achievable average rates 
for BA relaying for i.i.d. fading with ilsk = ^kD = Vfe, 
and P/ct^ = 10 dB, as a function of the number of relays 
M. As can be seen from this numerical example, the growth 
rate of the maximum average rate is inversely proportional to 
M, i.e., the growth rate of the average data rate decreases as 
M increases. In particular, the largest increase in data rate is 
observed when M increases from one to two relays, whereas 
the increase in the maximum average rate when M increases 
from 29 to 30 relays is almost negligible. This behavior can 
be most clearly seen from the expression for the average rate 
for low SNR given in (l33l l. According to (l33 . the average 
rate increases proportionally to 1 -f 1/2 + 1/3 + ... + 1/(2M). 
Therefore, when M is large, adding one more relay to the 
network has a negligible effect on the average rate. As bench¬ 
marks, we also show the average rate given in Qol and m, 
and the average rates achieved with the protocols in || 8 l and 
M- For i.i.d. links, as explained in Remark [T] the protocol in 
ID is identical to the protocol presented in Theorem 1, thereby 
leading to the same rate. 

In Fig. ID we plot the achievable average rate for BA 
relaying without and with a delay constraint, as a function 
of P/cr^, for i.i.d. fading and different numbers of relays M. 
This numerical example shows that, as the number of relays 
increases, the permissible average delay has to be increased 
in order for the rate of the delay constrained protocol to 
approach the rate of the non-delay constrained protocol. More 
precisely, for a single relay network, an average delay of five 
time slots is sufficient for the rate of the delay constrained 
protocol to approach the rate of the non-delay constrained 
protocol. However, for a network with two and four relays, 
the corresponding required delays are 7 and 10 time slots, 
respectively. For comparison, we have also plotted the average 
rate given in Col and Ca, which requires a delay of one time 
slot. Fig. ID shows that the average rate of the buffer-aided 
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relaying protocol with five time slots delay and only one relay in lfT3l Appendix B]. Finally, considering (fT4l i. if (fT^ holds 
surpasses the average rate in Qo) and O for four relays. Vfc, then ([TtT i holds as well. This concludes the proof. 


VII. Conclusion 

We have devised buffer-aided relaying protocols for the 
slow fading HD relay selection network and derived the 
corresponding achievable average rates. We have proposed a 
buffer-aided protocol which maximizes the achievable average 
rate but introduces an unbounded delay, and a buffer-aided 
protocol which bounds the average delay at the expense of a 
decrease in rate. We have shown that the new achievable rates 
are larger than the rates achieved with existing relay selection 
protocols. We have also provided centralized and distributed 
implementations of the proposed buffer-aided protocols, which 
do not cause more signaling overhead than conventional relay 
selection protocols for adaptive rate transmission and do not 
need any a priori knowledge of the statistics of the involved 
channels. 


Appendix 

A. Proof of Lemma Q] 

We denote the left and right hand sides of (fTSl l as Ak and 
Dk, respectively, i.e., 

1 ^ 

= lim (49) 

N^OO I\ 

1 ^ 

N—¥co iV 

2=1 

There are three possible cases for the relationship between Ak 
and Dk, i.e., Ak > Dk, Ak < Dk, and Ak = Dk- If Ak > Dk 
then the buffer of the fc-th relay is receiving more information 
than it transmits. Therefore, the average queue length in the 
buffer grows with time to infinity, and, as a result, Rko = 
Dk, for a proof please refer to ||271 Section 1.5]. Whereas, if 
Ak < Dk, due to the conservation of flow, the buffer cannot 
emit more information than it receives, and therefore RkD = 
Ak- We now prove that for Ak > Dk and Ak < Dk, RkD 
can always be increased by changing the values of r^{i) and 
rj(i). As a result, the only remaining possibility is that Rsd 
is maximized for Ak = Dk- Furthermore, since the achievable 
rate is given by Rsd = RkD, if RkD increases, Rsd 

will also increase. 

Assume first that Ak > Dk- Then, we can always increase 
Dk, and thereby increase RkD, by switching any r^(i) = 1 
for which Qk(i — 1) > 0 holds, from one to zero and, for 
the same i, switch rj (i) from zero to one. On the other hand, 
if Ak < Dk then we can always increase Ak, and thereby 
increase RkD, by switching any randomly chosen = 1 

from one to zero and, for the same i, switch r^(i) from zero to 
one. Now, since Rsd can always be improved when Ak > Dk 
or Ak < Dk, it follows that Rsd is maximized for Ak = Dk- 
Furthermore, when the RkD are maximized Vfc, then Rsd is 
also maximized. Moreover, for Ak = Dk the buffer at the fc-th 
relay is stable since the information that arrives at the buffer 
also leaves the buffer without information loss. On the other 
hand, the proof that (fTbl l holds when (fTSl l is satisfied is given 


B. Proof of Theorem Q] 

To solve (fTsT l. we first relax the binary constraints (f) € 
{ 0 , 1 } and G { 0 , 1 } in (fTST l to 0 < rj,{i) < 1 and 

0 < r^(i) < 1, Vi, respectively. Thereby, we transform the 
original problem ( fTSl l into the following linear optimization 
problem 


Maximize : 

r^(i),r^(i),Vi,k 

Subject to : 


iEZlEZlr■I(^)CkD(^) 

Cl :^EZi rmCsk(^) 

= wEZlrI(^)CkD(^),Vk 

C2 :0 < r^(*) < 1, Vk,i 
C3:0< r^(i) < 1, Vk,i 
C4 :0 < EZ=il^k(^) + rI(i)] < 1, Vk,i. 

(51) 


In the following, we solve the relaxed problem ( fSTT ) and then 
show that the optimal values of and Vi, k are at 

the boundaries, i.e., r^(i) € {0,1} and r'^(i) G {0,1}, Vi,k. 
Therefore, the solution of the relaxed problem (ISTT i is also the 
solution to the original maximization problem in (fTsT l. 

Since (fSTT i is a linear optimization problem, we can solve it 
by using the method of Lagrange multipliers. The Lagrangian 
function for maximization problem (fSTT i is given by 


M N 

^ = E]^E^k(i)CkD(^) 

k — 1 2=1 

M / N N N 

lf^'^k{i)CkD{i) - j^^rl{i)Csk{i) 


/c=l 
M , N 


2=1 
M , N 


" ^ ^ ^ (*) - 1 ) + ^ ^ ^ {i)rl(i) 

k—1 2=1 k—1 2—1 

M N M N 

- Z! ('■fc -1)+ z z (*) 

k—1 2=1 k—1 2=1 

N /M \ 

~ ]v Z^(*) (Zt^fc ^ j 


2=1 

N 


\k^l 
^ M 




(52) 


\k=l 


where ^k/N, a%{i)/N, fi%{i)/N, for x G {R,T}, (j){i)/N, 
and X{i)/N are Lagrange multipliers. These multipliers have 
to satisfy the following conditions. 

1) Dual feasibility condition: The Lagrange multipliers for 
the inequality constraints have to be non-negative, i.e.. 


a^(t) > 0, al{i) > 0, I3f{i) > 0, /3j(f) > 0, 

</<(*)> 0, A(f) > 0, Vi,k. (53) 


have to hold. 

2) Complementary slackness condition: If an inequality is 
inactive, i.e., the optimal solution is in the interior of the 
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corresponding set, the corresponding Lagrange multipliers are 
zero. Therefore, we obtain 

(0 (»'fc - 1) = afc (*) (’’fc - 1) = 0, 

Pkiiyl = 0, /3fc (i)rl = 0, Vi, k 

M \ 

^[r^(i) +rj(i)] - 1 I =0, 
k^l / 

M \ 

^[r^(i)+ r^(i)] I = 0, Vi,fc. 

k=l / 

We now differentiate the Lagrangian function with respect 
to rJJ(i) and r]^(i), for n G {1,M} and m G {1 ,M}, 
and equate the results to zero, respectively. This leads to the 
following two equations 

f^uCsnii) = a^ii) - P^ii) + P{i) - A(*) (57) 

(1 - Hm) CmDii) = aliii) - /3m(*) + </’(*) “ A(i). (58) 

We first show that for the optimal solution of r^{i) and 
rj^(i), 0 < r^{i) < 1 and/or 0 < rj^(i) < 1 cannot 
hold for any n,m G M}, and only r^(i) G {0,1} 

and r^(i) G {0,1} can hold \/n,m = 1,...,M. We prove 
this by contradiction. Assume that 0 < r^(i) < 1 and 
0 < (*)] < 1- Then, according to (l54l) . 

(0 = Pni'^) = 4’i'^) = A(i) = 0 must hold. Inserting this 
into (fSThl. we obtain 




Vi, k (54) 
(55) 


(56) 


must hold. Since (l62T i also cannot hold for an^ arbitrarily 
chosen i, we obtain that 0 < r^(i) < 1 and (0 + 

(i)] = 1 cannot hold. Therefore, the only other possibility 
is that rj}(i) G {0,1} must hold. 

Following the same approach as above, we can also prove 
that r^(i) G {0,1} must hold. Moreover, due to constraint C4 
in dSD, it is clear that if rj}(i) = 1, for any n G {1,..., M}, 
then r{}(i) = 0 for all k = 1, k ^ n, and r^(i) = 0 

for all m = 1,...,M must hold. Similarly, if r^{i) = 1, for 
any m G {1,...,M}, then rj(i) = 0 for all k = 1,...,M, 

k m, and r{}(i) = 0 for all n = 1,..., M must hold. In the 

following, we investigate the conditions under which r{}(i) = 
1 and all other r^(i) = 0 for k = 1, k ^ n, and all 

other r^{i) = 0 for m = 1,..., M. 

Assume r^{i) = 1. Then, r}}(i) = 0 for k = 1,...,M, 
k n, and rj^(i) = 0 for m = must hold. As a 

result, according to (l5^ . aj}(i) > 0, /3{}(i) > 0, P^ii) > 0, 

<^(i) > 1, and /3{}(i) = a^{i) = = A(i) = 0 must hold, 

for fc = 1, k ^ n, and m = 1, Inserting these 

variables in (l57l i. we obtain the following 

finCsnii) = + </>(/), (63) 

^lkCsk(i) = -/3fc (/) + </>(/), V/c n (64) 

[I - ^lm)CmD(i) = -/?^(*)+</<(/), Vm. (65) 

Subtracting (l64l i from (l63t and subtracting dbST l from (l6^ . we 
obtain 


y^nCsnii) = 0. (59) 

Since C's„(i) is an RV, (l5^ can hold only for = 0. 
However, if we assume = 0, and insert /r„ = 0 in (l57b ) 
by setting m = n, we obtain 

CnD{i)=al{i)-Pl{i)- (60) 

Since Cnnii) is a non-negative RV, and since either a^(i) or 
/3j} (i) can be larger than zero but not both, in order for (l60l l to 
hold, Pn{i) must be zero and a^{i) = CnD{i)- On the other 
hand, if /3^ (/) = 0, it would mean that rj} (i) = 1. However, if 
r^(i) = 1 and 0 < < 1 hold jointly, this would violate 

our starting assumption that 0 < (0] < 1 

holds. Hence, 0 < r^(i) < 1 and 0 < + (01 < 

1 cannot hold. 

Now, let us assume that 0 < < 1 and + 

rj(i)] = 1. Since r^(^) < 1, then at least one other variable 
rj}(/) or r^(i) has to be larger than zero but smaller than 
one, where k G {1,..., M}, k n, and m G {1,..., M}. Let 
us assume that this variable is r^{i), where k n. Hence, 
0 < < 1, for k n. Then, according to (l54l i. = 

Pn{P) = ctfe (*) = Pki}) = A(/) = 0, and pp) > 0 must hold. 
Inserting these values in (l57h). we obtain 

Sn{P) — P{P) — l^kCSk{P) • (61) 

However, since Csn{i) and Cskii) are independent RVs, (ImT i 
cannot hold for any arbitrarily chosen i. On the other hand, if 
we assume that instead of the variable which is larger 

than one is rj (i), we would have obtained that 

tJ-nCsn{i) = p{i) = (1 - tJ-k)CkDii) (62) 


fJ-nCsnii) - fJ-kCskii) = + Pk(^), Vk 

( 66 ) 

^J'nCsn{^) - (1 - ^J'7n)CmD{^) = «!{(/) + Pmi'i)^ ^m. (67) 

Since «{}(/) + /?{}(/) > 0 and a^(i) + Pm(^) ^ 0 hold, it 
follows that r^(i) = 1 when the following holds 

Snip) ^ Skip)t Vfc ^ Tl 

AND ^JinCsni^) > (1 - ^im)CmDii), Vm. (68) 

Eq. (l68T l can be written in compact form as 

= 1 if fJ-kCskii) = maxVl(/), (69) 

where set Aii) is defined in ( fT9] l. Following the same approach 
as above, we can prove that 

rj(z) = l if (1 -/rfc)C'fcD(/) = maxVl(/). (70) 

Combining (l6^ and (iTOl i. we obtain (l20l i. This completes the 
proof of Theorem [T] 

C. Proof of Lemma \2\ 

The optimal Hk, Vfc, are found from the system of M 
equations given in (EB. Using the definition of the expected 
value, (ItTIi can be written equivalently as (ItB . where the RVs 
Tsk and Tko are given by 

r = / TSfc if A‘/cC's/c = maxVl 

} 0 if HkCsk y maxVl ’ 

p _ / ^kD if il - Hk)CkD = max A 

} 0 if if - fj.k)CkD y max A. 
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Hence, to find the optimal ^k, Vfc, we only have to find the 
PDFs of Tsk and Tko, frski^) ^"d frkni^)’ ^"d insert them 
into (l23t . In the following, we first derive the PDF of Tsk- 
Using (ItTI i. we can obtain the PDF of Tsk, fvski^)’ for 
X > 0, as 











frskix) = fJskix)Pr{^J■kCsk = max,4}, x > 0, 


(72) 


Fig. 7. Equivalent single buffer model. 


where Pr{-} denotes probability. Note that the distribution of 
f^sk {^) for = 0, is not needed for the computation of the 
expectations in (|2TI) and (l22l i. The only unknown in (|7^ is 
the probability Pr[jikCsk = max.4}. In the following, we 
derive this probability. To this end, we set jsk = x, and obtain 

Prj^fcCsfc = max.4} = Prjp.^ log2(l + x) = max.4} 

M 

= n Pr{4j log 2 (l+ 7Si) < 4/clog2(l+ a;)} 

M 

X < Mfclog2(l + a;)} 

f=i 

= Pr|75j < (1+ x)^ - l| 

M 

X Pr |7jxi < (1 + - l| 

i=i 

= n ((i+^)^-i) 

^ IX 

X (73) 

i=i 

where (x) is the CDF of 7 q,, for a G {Sk, kD}. Inserting 
(|73ll into (l72l i. we obtain (l24l i. Following a similar procedure 
as above, we obtain the distribution of Pko given in (l25l l. 

Now, assume that all source-to-relay and relay-to- 
destination links are i.i.d. Then, /^^^(x) = f-tkoix) = f~/ix) 
holds V/c. Moreover, U^gj,(x) = = F^{x) also holds 

Vfc. As a result, dull and (l25l l can be written for x > 0 as 

frskix) = fj{x)F^ (^(1 + x)r^ - l) 

M 

/ P'k \ / F'k \ 

X n7^7 ((1 + 2;)^ - ij PV ((1 + a;)^ - ij , (74) 

f=i 

j^k 

hkoix) = Mx)F^ (^(1 + x)“^ - 

\ / ^-k-k 

X Jd Fiy ( (1 + x) — 1 j Fly ( (1 + x) — 1 

(75) 

We observe that frskix) and fTko^x) in (|74] | and (fTSl l. re¬ 
spectively, are both functions of and show this explicitly by 
redefining them as frsk{x,ikk) and /r^o(a;,/rfc), respectively. 
Moreover, from (l74l i and (l75l l we observe that 

frkD{x,l^k) = frskix,I- l^k) (76) 


holds. If we now insert ( 176] ) into ( l23T l, we obtain 

log2(l + x)/rsfc(x,/rfc)dx 

log2(l+x)/rsfc(x, 1 -/rfe)dx, Vfc = 1,...,M. (77) 

Now, observe that (fTTl i holds if and only if /ifc = l — fj,k, which 
leads to ^k = \I2. This concludes the proof. 




D. Proof of Lemma 0 

The average delay for a system with M parallel queues is 
well known, and given by li28] Eq. 11.69]. After changing the 
notations in li28] Eq. 11.69] to our notations, we directly obtain 
(l43l l. In the following, we give an alternative, more intuitive 
proof of (i43l l. 

The input-output dynamics at the M buffers in the con¬ 
sidered network during N time slots can be represented 
equivalently by a single buffer model, shown in Eig. |7] The 
different colors in this model correspond to the information 
bits which are received/transmitted by the different relays. Eor 
example, the blue, green, and red colors correspond to the bits 
that are send/received via relay 1, 2, and 3, respectively. In this 
model, the buffer is filled in the same order as the order of the 
packets that arrive at the buffers at the different relays. Which 
packet arrives at the equivalent buffer depends on the position 
of the input switch in each time slot, which on the other 
hand, depends on the values of Vf,fc. The extraction 

of the bits from the equivalent buffer also depends on the 
position of the output switch in each time slot, which on the 
other hand, depends on the values of rj(f), Vi, k. Moreover, 
when the output switch is set to a line with a specific color, 
only bits with that color are extracted from the equivalent 
buffer. Hence, the extraction order is different from the order 
of filling the equivalent buffer. Nevertheless, since the average 
delay computed by Little’s formula 1^ , is independent of the 
order of extracting from the buffer, see ll^ pp. 89-91], for the 
system model in Eig. |7] the average delay T can be computed 
as l29l 


rj, _ Qeq 

^ ~ A ’ 

-^eq 


(78) 


where Qeq is the average queue size of the equivalent buffer 
and Aeq is the average arrival rate of the equivalent buffer. 
Now, using the fact that Qeq = Efcli Qk and Aeq = 
Tf!k=i ffsk, we obtain (l43l l. This concludes the proof. 
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